Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures.

نویسندگان

  • Luigi Genovese
  • Matthieu Ospici
  • Thierry Deutsch
  • Jean-François Méhaut
  • Alexey Neelov
  • Stefan Goedecker
چکیده

We present the implementation of a full electronic structure calculation code on a hybrid parallel architecture with graphic processing units (GPUs). This implementation is performed on a free software code based on Daubechies wavelets. Such code shows very good performances, systematic convergence properties, and an excellent efficiency on parallel computers. Our GPU-based acceleration fully preserves all these properties. In particular, the code is able to run on many cores which may or may not have a GPU associated, and thus on parallel and massive parallel hybrid machines. With double precision calculations, we may achieve considerable speedup, between a factor of 20 for some operations and a factor of 6 for the whole density functional theory code.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Density Functional Theory calculation on many-cores hybrid CPU-GPU architectures

Luigi Genovese, Matthieu Ospici, 3, 4 Thierry Deutsch, Jean-François Méhaut, Alexey Neelov, 6 and Stefan Goedecker European Synchrotron Radiation Facility, 6 rue Horowitz, BP 220, 38043 Grenoble France∗ Université Joseph Fourier Laboratoire d’Informatique de Grenoble INRIA, Grenoble, France Bull SAS, 1 rue de Provence, 38130 Echirolles, France Laboratoire de Simulation Atomistique (L Sim), SP2M...

متن کامل

A Novel Multiply-Accumulator Unit Bus Encoding Architecture for Image Processing Applications

In the CMOS circuit power dissipation is a major concern for VLSI functional units. With shrinking feature size, increased frequency and power dissipation on the data bus have become the most important factor compared to other parts of the functional units. One of the most important functional units in any processor is the Multiply-Accumulator unit (MAC). The current work focuses on the develop...

متن کامل

Implementing molecular dynamics on hybrid high performance computers - short range forces

The use of accelerators such as graphics processing units (GPUs) has become popular in scientific computing applications due to their low cost, impressive floating-point capabilities, high memory bandwidth, and low electrical power requirements. Hybrid highperformance computers, machines with more than one type of floating-point processor, are now becoming more prevalent due to these advantages...

متن کامل

Performance Analysis of FEM Algorithms on GPU and Many-Core Architectures

The roadmaps of the leading supercomputer manufacturers are based on hybrid systems, which consist of a mix of conventional processors and accelerators. This trend is mainly due to the fact that the power consumption cost of the future cpu-only Exascale systems will be unsustainable, thus accelerators such as graphic processing units (GPUs) and many-integrated-core (MIC) will likely be the inte...

متن کامل

An implementation of level set based topology optimization using GPU

1. Abstract This work presents the implementation of a topology optimization approach based on level set method in massively parallel computer architectures, in particular on a Graphics Processing Unit (GPU). Such architectures are becoming so popular during last years for complex and tedious scientific computation. They are composed of dozens, hundreds, or even thousands of cores specially des...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of chemical physics

دوره 131 3  شماره 

صفحات  -

تاریخ انتشار 2009